Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 23921 |
| Missing cells | 126580 |
| Missing cells (%) | 18.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.0 MiB |
| Average record size in memory | 217.0 B |
Variable types
| Categorical | 9 |
|---|---|
| Numeric | 17 |
| Boolean | 2 |
date has a high cardinality: 5550 distinct values | High cardinality |
home_team has a high cardinality: 211 distinct values | High cardinality |
away_team has a high cardinality: 211 distinct values | High cardinality |
tournament has a high cardinality: 82 distinct values | High cardinality |
city has a high cardinality: 1576 distinct values | High cardinality |
country has a high cardinality: 217 distinct values | High cardinality |
home_team_fifa_rank is highly correlated with away_team_fifa_rank and 6 other fields | High correlation |
away_team_fifa_rank is highly correlated with home_team_fifa_rank and 6 other fields | High correlation |
home_team_total_fifa_points is highly correlated with year and 6 other fields | High correlation |
away_team_total_fifa_points is highly correlated with year and 7 other fields | High correlation |
home_team_goalkeeper_score is highly correlated with home_team_fifa_rank and 4 other fields | High correlation |
away_team_goalkeeper_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
home_team_mean_defense_score is highly correlated with home_team_fifa_rank and 4 other fields | High correlation |
away_team_mean_defense_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
home_team_mean_midfield_score is highly correlated with home_team_fifa_rank and 5 other fields | High correlation |
away_team_mean_midfield_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
home_team_mean_offense_score is highly correlated with home_team_fifa_rank and 4 other fields | High correlation |
away_team_mean_offense_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
away_team_continent is highly correlated with home_team_continent and 1 other fields | High correlation |
tournament is highly correlated with year and 8 other fields | High correlation |
home_team_continent is highly correlated with away_team_continent and 1 other fields | High correlation |
neutral_location is highly correlated with tournament | High correlation |
year is highly correlated with home_team_total_fifa_points and 2 other fields | High correlation |
month is highly correlated with day and 1 other fields | High correlation |
day is highly correlated with month | High correlation |
away_team_score is highly correlated with home_team_result | High correlation |
home_team_result is highly correlated with away_team_score | High correlation |
home_team_goalkeeper_score has 15542 (65.0%) missing values | Missing |
away_team_goalkeeper_score has 15826 (66.2%) missing values | Missing |
home_team_mean_defense_score has 16134 (67.4%) missing values | Missing |
away_team_mean_defense_score has 16357 (68.4%) missing values | Missing |
home_team_mean_midfield_score has 15759 (65.9%) missing values | Missing |
away_team_mean_midfield_score has 15942 (66.6%) missing values | Missing |
home_team_mean_offense_score has 15411 (64.4%) missing values | Missing |
away_team_mean_offense_score has 15609 (65.3%) missing values | Missing |
home_team_total_fifa_points has 14290 (59.7%) zeros | Zeros |
away_team_total_fifa_points has 14288 (59.7%) zeros | Zeros |
home_team_score has 6273 (26.2%) zeros | Zeros |
away_team_score has 9558 (40.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-09 19:06:36.649812 |
|---|---|
| Analysis finished | 2022-10-09 19:07:31.051401 |
| Duration | 54.4 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 5550 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| 2012-02-29 | 66 |
|---|---|
| 2016-03-29 | 59 |
| 2008-03-26 | 59 |
| 2014-03-05 | 57 |
| 2022-03-29 | 55 |
| Other values (5545) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 239210 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1934 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | 1993-08-08 |
|---|---|
| 2nd row | 1993-08-08 |
| 3rd row | 1993-08-08 |
| 4th row | 1993-08-08 |
| 5th row | 1993-08-08 |
Common Values
| Value | Count | Frequency (%) |
| 2012-02-29 | 66 | 0.3% |
| 2016-03-29 | 59 | 0.2% |
| 2008-03-26 | 59 | 0.2% |
| 2014-03-05 | 57 | 0.2% |
| 2022-03-29 | 55 | 0.2% |
| 2012-11-14 | 55 | 0.2% |
| 2011-10-11 | 54 | 0.2% |
| 2011-11-11 | 54 | 0.2% |
| 2011-11-15 | 53 | 0.2% |
| 2011-09-02 | 52 | 0.2% |
| Other values (5540) | 23357 |
Length
| Value | Count | Frequency (%) |
| 2012-02-29 | 66 | 0.3% |
| 2008-03-26 | 59 | 0.2% |
| 2016-03-29 | 59 | 0.2% |
| 2014-03-05 | 57 | 0.2% |
| 2012-11-14 | 55 | 0.2% |
| 2022-03-29 | 55 | 0.2% |
| 2011-10-11 | 54 | 0.2% |
| 2011-11-11 | 54 | 0.2% |
| 2011-11-15 | 53 | 0.2% |
| 2011-09-02 | 52 | 0.2% |
| Other values (5540) | 23357 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| - | 47842 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 6.7% |
| 6 | 9098 | 3.8% |
| 3 | 7542 | 3.2% |
| 7 | 6383 | 2.7% |
| 8 | 6307 | 2.6% |
| 5 | 5981 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 191368 | |
| Dash Punctuation | 47842 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 8.3% |
| 6 | 9098 | 4.8% |
| 3 | 7542 | 3.9% |
| 7 | 6383 | 3.3% |
| 8 | 6307 | 3.3% |
| 5 | 5981 | 3.1% |
| 4 | 5494 | 2.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47842 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 239210 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| - | 47842 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 6.7% |
| 6 | 9098 | 3.8% |
| 3 | 7542 | 3.2% |
| 7 | 6383 | 2.7% |
| 8 | 6307 | 2.6% |
| 5 | 5981 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 239210 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| - | 47842 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 6.7% |
| 6 | 9098 | 3.8% |
| 3 | 7542 | 3.2% |
| 7 | 6383 | 2.7% |
| 8 | 6307 | 2.6% |
| 5 | 5981 | 2.5% |
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2008.277998 |
| Minimum | 1993 |
|---|---|
| Maximum | 2022 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1993 |
|---|---|
| 5-th percentile | 1995 |
| Q1 | 2001 |
| median | 2008 |
| Q3 | 2015 |
| 95-th percentile | 2021 |
| Maximum | 2022 |
| Range | 29 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.029469422 |
|---|---|
| Coefficient of variation (CV) | 0.003998186222 |
| Kurtosis | -1.119182907 |
| Mean | 2008.277998 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.05465971 |
| Sum | 48040018 |
| Variance | 64.4723792 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2021 | 1077 | 4.5% |
| 2019 | 1075 | 4.5% |
| 2008 | 1034 | 4.3% |
| 2011 | 1022 | 4.3% |
| 2004 | 1016 | 4.2% |
| 2000 | 993 | 4.2% |
| 2001 | 953 | 4.0% |
| 2013 | 942 | 3.9% |
| 2015 | 938 | 3.9% |
| 2012 | 927 | 3.9% |
| Other values (20) | 13944 |
| Value | Count | Frequency (%) |
| 1993 | 171 | 0.7% |
| 1994 | 494 | |
| 1995 | 564 | |
| 1996 | 781 | |
| 1997 | 797 | |
| 1998 | 636 | |
| 1999 | 670 | |
| 2000 | 993 | |
| 2001 | 953 | |
| 2002 | 701 |
| Value | Count | Frequency (%) |
| 2022 | 571 | |
| 2021 | 1077 | |
| 2020 | 298 | 1.2% |
| 2019 | 1075 | |
| 2018 | 830 | |
| 2017 | 885 | |
| 2016 | 867 | |
| 2015 | 938 | |
| 2014 | 788 | |
| 2013 | 942 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.856694954 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.268226166 |
|---|---|
| Coefficient of variation (CV) | 0.4766474501 |
| Kurtosis | -1.146959789 |
| Mean | 6.856694954 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.2201633437 |
| Sum | 164019 |
| Variance | 10.68130227 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 4048 | |
| 10 | 3029 | |
| 9 | 2852 | |
| 11 | 2712 | |
| 3 | 2577 | |
| 1 | 1511 | 6.3% |
| 2 | 1366 | 5.7% |
| 8 | 1351 | 5.6% |
| 7 | 1275 | 5.3% |
| 5 | 1274 | 5.3% |
| Other values (2) | 1926 |
| Value | Count | Frequency (%) |
| 1 | 1511 | 6.3% |
| 2 | 1366 | 5.7% |
| 3 | 2577 | |
| 4 | 906 | 3.8% |
| 5 | 1274 | 5.3% |
| 6 | 4048 | |
| 7 | 1275 | 5.3% |
| 8 | 1351 | 5.6% |
| 9 | 2852 | |
| 10 | 3029 |
| Value | Count | Frequency (%) |
| 12 | 1020 | 4.3% |
| 11 | 2712 | |
| 10 | 3029 | |
| 9 | 2852 | |
| 8 | 1351 | 5.6% |
| 7 | 1275 | 5.3% |
| 6 | 4048 | |
| 5 | 1274 | 5.3% |
| 4 | 906 | 3.8% |
| 3 | 2577 |
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.78169809 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 14 |
| Q3 | 22 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.566008765 |
|---|---|
| Coefficient of variation (CV) | 0.579500996 |
| Kurtosis | -1.089846332 |
| Mean | 14.78169809 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.2675643283 |
| Sum | 353593 |
| Variance | 73.37650616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 1273 | 5.3% |
| 7 | 1096 | 4.6% |
| 6 | 1095 | 4.6% |
| 8 | 1065 | 4.5% |
| 10 | 1023 | 4.3% |
| 15 | 964 | 4.0% |
| 12 | 933 | 3.9% |
| 9 | 919 | 3.8% |
| 14 | 915 | 3.8% |
| 5 | 866 | 3.6% |
| Other values (21) | 13772 |
| Value | Count | Frequency (%) |
| 1 | 595 | |
| 2 | 753 | |
| 3 | 716 | |
| 4 | 730 | |
| 5 | 866 | |
| 6 | 1095 | |
| 7 | 1096 | |
| 8 | 1065 | |
| 9 | 919 | |
| 10 | 1023 |
| Value | Count | Frequency (%) |
| 31 | 426 | |
| 30 | 569 | |
| 29 | 775 | |
| 28 | 748 | |
| 27 | 675 | |
| 26 | 725 | |
| 25 | 634 | |
| 24 | 645 | |
| 23 | 527 | |
| 22 | 631 |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| mexico | 316 |
|---|---|
| usa | 314 |
| japan | 280 |
| saudi arabia | 272 |
| korea republic | 249 |
| Other values (206) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 8.101793403 |
| Min length | 3 |
Characters and Unicode
| Total characters | 193803 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | bolivia |
|---|---|
| 2nd row | brazil |
| 3rd row | ecuador |
| 4th row | guinea |
| 5th row | paraguay |
Common Values
| Value | Count | Frequency (%) |
| mexico | 316 | 1.3% |
| usa | 314 | 1.3% |
| japan | 280 | 1.2% |
| saudi arabia | 272 | 1.1% |
| korea republic | 249 | 1.0% |
| qatar | 249 | 1.0% |
| oman | 241 | 1.0% |
| united arab emirates | 239 | 1.0% |
| brazil | 233 | 1.0% |
| south africa | 229 | 1.0% |
| Other values (201) | 21299 |
Length
| Value | Count | Frequency (%) |
| republic | 714 | 2.4% |
| and | 500 | 1.7% |
| korea | 323 | 1.1% |
| mexico | 316 | 1.1% |
| usa | 314 | 1.1% |
| ireland | 291 | 1.0% |
| japan | 280 | 0.9% |
| arabia | 272 | 0.9% |
| saudi | 272 | 0.9% |
| islands | 267 | 0.9% |
| Other values (236) | 25994 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 32405 | |
| i | 17882 | 9.2% |
| n | 16105 | 8.3% |
| e | 14189 | 7.3% |
| r | 13525 | 7.0% |
| o | 10410 | 5.4% |
| l | 8746 | 4.5% |
| s | 8589 | 4.4% |
| t | 8084 | 4.2% |
| u | 7826 | 4.0% |
| Other values (25) | 56042 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 187811 | |
| Space Separator | 5622 | 2.9% |
| Other Punctuation | 313 | 0.2% |
| Dash Punctuation | 57 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 32405 | |
| i | 17882 | 9.5% |
| n | 16105 | 8.6% |
| e | 14189 | 7.6% |
| r | 13525 | 7.2% |
| o | 10410 | 5.5% |
| l | 8746 | 4.7% |
| s | 8589 | 4.6% |
| t | 8084 | 4.3% |
| u | 7826 | 4.2% |
| Other values (21) | 50050 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 159 | |
| ' | 154 |
Space Separator
| Value | Count | Frequency (%) |
| 5622 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 187811 | |
| Common | 5992 | 3.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 32405 | |
| i | 17882 | 9.5% |
| n | 16105 | 8.6% |
| e | 14189 | 7.6% |
| r | 13525 | 7.2% |
| o | 10410 | 5.5% |
| l | 8746 | 4.7% |
| s | 8589 | 4.6% |
| t | 8084 | 4.3% |
| u | 7826 | 4.2% |
| Other values (21) | 50050 |
Common
| Value | Count | Frequency (%) |
| 5622 | ||
| . | 159 | 2.7% |
| ' | 154 | 2.6% |
| - | 57 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 193565 | |
| None | 238 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 32405 | |
| i | 17882 | 9.2% |
| n | 16105 | 8.3% |
| e | 14189 | 7.3% |
| r | 13525 | 7.0% |
| o | 10410 | 5.4% |
| l | 8746 | 4.5% |
| s | 8589 | 4.4% |
| t | 8084 | 4.2% |
| u | 7826 | 4.0% |
| Other values (20) | 55804 |
None
| Value | Count | Frequency (%) |
| ô | 154 | |
| ç | 33 | 13.9% |
| ã | 17 | 7.1% |
| é | 17 | 7.1% |
| í | 17 | 7.1% |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| zambia | 243 |
|---|---|
| costa rica | 217 |
| paraguay | 216 |
| sweden | 206 |
| mexico | 201 |
| Other values (206) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 8.126959575 |
| Min length | 3 |
Characters and Unicode
| Total characters | 194405 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | uruguay |
|---|---|
| 2nd row | mexico |
| 3rd row | venezuela |
| 4th row | sierra leone |
| 5th row | argentina |
Common Values
| Value | Count | Frequency (%) |
| zambia | 243 | 1.0% |
| costa rica | 217 | 0.9% |
| paraguay | 216 | 0.9% |
| sweden | 206 | 0.9% |
| mexico | 201 | 0.8% |
| brazil | 200 | 0.8% |
| jamaica | 199 | 0.8% |
| saudi arabia | 199 | 0.8% |
| iraq | 199 | 0.8% |
| ghana | 198 | 0.8% |
| Other values (201) | 21843 |
Length
| Value | Count | Frequency (%) |
| republic | 641 | 2.2% |
| and | 511 | 1.7% |
| korea | 331 | 1.1% |
| islands | 284 | 1.0% |
| ireland | 252 | 0.9% |
| zambia | 243 | 0.8% |
| guinea | 241 | 0.8% |
| congo | 229 | 0.8% |
| costa | 217 | 0.7% |
| rica | 217 | 0.7% |
| Other values (236) | 26234 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 32193 | |
| i | 18158 | 9.3% |
| n | 16114 | 8.3% |
| e | 14573 | 7.5% |
| r | 13263 | 6.8% |
| o | 10388 | 5.3% |
| l | 8829 | 4.5% |
| s | 8644 | 4.4% |
| u | 8051 | 4.1% |
| t | 7841 | 4.0% |
| Other values (25) | 56351 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 188479 | |
| Space Separator | 5479 | 2.8% |
| Other Punctuation | 354 | 0.2% |
| Dash Punctuation | 93 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 32193 | |
| i | 18158 | 9.6% |
| n | 16114 | 8.5% |
| e | 14573 | 7.7% |
| r | 13263 | 7.0% |
| o | 10388 | 5.5% |
| l | 8829 | 4.7% |
| s | 8644 | 4.6% |
| u | 8051 | 4.3% |
| t | 7841 | 4.2% |
| Other values (21) | 50425 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 182 | |
| . | 172 |
Space Separator
| Value | Count | Frequency (%) |
| 5479 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 93 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 188479 | |
| Common | 5926 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 32193 | |
| i | 18158 | 9.6% |
| n | 16114 | 8.5% |
| e | 14573 | 7.7% |
| r | 13263 | 7.0% |
| o | 10388 | 5.5% |
| l | 8829 | 4.7% |
| s | 8644 | 4.6% |
| u | 8051 | 4.3% |
| t | 7841 | 4.2% |
| Other values (21) | 50425 |
Common
| Value | Count | Frequency (%) |
| 5479 | ||
| ' | 182 | 3.1% |
| . | 172 | 2.9% |
| - | 93 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 194113 | |
| None | 292 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 32193 | |
| i | 18158 | 9.4% |
| n | 16114 | 8.3% |
| e | 14573 | 7.5% |
| r | 13263 | 6.8% |
| o | 10388 | 5.4% |
| l | 8829 | 4.5% |
| s | 8644 | 4.5% |
| u | 8051 | 4.1% |
| t | 7841 | 4.0% |
| Other values (20) | 56059 |
None
| Value | Count | Frequency (%) |
| ô | 182 | |
| ç | 35 | 12.0% |
| ã | 25 | 8.6% |
| é | 25 | 8.6% |
| í | 25 | 8.6% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| europe | |
|---|---|
| africa | |
| asia | |
| north america | |
| south america |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.92818026 |
| Min length | 4 |
Characters and Unicode
| Total characters | 165729 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | south america |
|---|---|
| 2nd row | south america |
| 3rd row | south america |
| 4th row | africa |
| 5th row | south america |
Common Values
| Value | Count | Frequency (%) |
| europe | 7593 | |
| africa | 5885 | |
| asia | 5302 | |
| north america | 2772 | 11.6% |
| south america | 1839 | 7.7% |
| oceania | 530 | 2.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| europe | 7593 | |
| africa | 5885 | |
| asia | 5302 | |
| america | 4611 | |
| north | 2772 | 9.7% |
| south | 1839 | 6.4% |
| oceania | 530 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 32656 | |
| r | 20861 | |
| e | 20327 | |
| i | 16328 | |
| o | 12734 | 7.7% |
| c | 11026 | 6.7% |
| u | 9432 | 5.7% |
| p | 7593 | 4.6% |
| s | 7141 | 4.3% |
| f | 5885 | 3.6% |
| Other values (5) | 21746 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161118 | |
| Space Separator | 4611 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 32656 | |
| r | 20861 | |
| e | 20327 | |
| i | 16328 | |
| o | 12734 | 7.9% |
| c | 11026 | 6.8% |
| u | 9432 | 5.9% |
| p | 7593 | 4.7% |
| s | 7141 | 4.4% |
| f | 5885 | 3.7% |
| Other values (4) | 17135 |
Space Separator
| Value | Count | Frequency (%) |
| 4611 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 161118 | |
| Common | 4611 | 2.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 32656 | |
| r | 20861 | |
| e | 20327 | |
| i | 16328 | |
| o | 12734 | 7.9% |
| c | 11026 | 6.8% |
| u | 9432 | 5.9% |
| p | 7593 | 4.7% |
| s | 7141 | 4.4% |
| f | 5885 | 3.7% |
| Other values (4) | 17135 |
Common
| Value | Count | Frequency (%) |
| 4611 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 165729 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 32656 | |
| r | 20861 | |
| e | 20327 | |
| i | 16328 | |
| o | 12734 | 7.7% |
| c | 11026 | 6.7% |
| u | 9432 | 5.7% |
| p | 7593 | 4.6% |
| s | 7141 | 4.3% |
| f | 5885 | 3.6% |
| Other values (5) | 21746 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| europe | |
|---|---|
| africa | |
| asia | |
| north america | |
| south america |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 7.044646963 |
| Min length | 4 |
Characters and Unicode
| Total characters | 168515 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | south america |
|---|---|
| 2nd row | north america |
| 3rd row | south america |
| 4th row | africa |
| 5th row | south america |
Common Values
| Value | Count | Frequency (%) |
| europe | 7359 | |
| africa | 6306 | |
| asia | 4817 | |
| north america | 2703 | 11.3% |
| south america | 2161 | 9.0% |
| oceania | 575 | 2.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| europe | 7359 | |
| africa | 6306 | |
| america | 4864 | |
| asia | 4817 | |
| north | 2703 | 9.4% |
| south | 2161 | 7.5% |
| oceania | 575 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 33124 | |
| r | 21232 | |
| e | 20157 | |
| i | 16562 | |
| o | 12798 | 7.6% |
| c | 11745 | 7.0% |
| u | 9520 | 5.6% |
| p | 7359 | 4.4% |
| s | 6978 | 4.1% |
| f | 6306 | 3.7% |
| Other values (5) | 22734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 163651 | |
| Space Separator | 4864 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 33124 | |
| r | 21232 | |
| e | 20157 | |
| i | 16562 | |
| o | 12798 | 7.8% |
| c | 11745 | 7.2% |
| u | 9520 | 5.8% |
| p | 7359 | 4.5% |
| s | 6978 | 4.3% |
| f | 6306 | 3.9% |
| Other values (4) | 17870 |
Space Separator
| Value | Count | Frequency (%) |
| 4864 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 163651 | |
| Common | 4864 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 33124 | |
| r | 21232 | |
| e | 20157 | |
| i | 16562 | |
| o | 12798 | 7.8% |
| c | 11745 | 7.2% |
| u | 9520 | 5.8% |
| p | 7359 | 4.5% |
| s | 6978 | 4.3% |
| f | 6306 | 3.9% |
| Other values (4) | 17870 |
Common
| Value | Count | Frequency (%) |
| 4864 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 168515 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 33124 | |
| r | 21232 | |
| e | 20157 | |
| i | 16562 | |
| o | 12798 | 7.6% |
| c | 11745 | 7.0% |
| u | 9520 | 5.6% |
| p | 7359 | 4.4% |
| s | 6978 | 4.1% |
| f | 6306 | 3.7% |
| Other values (5) | 22734 |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.85468835 |
| Minimum | 1 |
|---|---|
| Maximum | 211 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 33 |
| median | 71 |
| Q3 | 115 |
| 95-th percentile | 174 |
| Maximum | 211 |
| Range | 210 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 52.35522517 |
|---|---|
| Coefficient of variation (CV) | 0.6724736337 |
| Kurtosis | -0.7546146622 |
| Mean | 77.85468835 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 0.4514227146 |
| Sum | 1862362 |
| Variance | 2741.069603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 214 | 0.9% |
| 3 | 209 | 0.9% |
| 29 | 203 | 0.8% |
| 5 | 198 | 0.8% |
| 11 | 198 | 0.8% |
| 10 | 198 | 0.8% |
| 1 | 198 | 0.8% |
| 4 | 197 | 0.8% |
| 34 | 197 | 0.8% |
| 12 | 197 | 0.8% |
| Other values (201) | 21912 |
| Value | Count | Frequency (%) |
| 1 | 198 | |
| 2 | 187 | |
| 3 | 209 | |
| 4 | 197 | |
| 5 | 198 | |
| 6 | 183 | |
| 7 | 178 | |
| 8 | 196 | |
| 9 | 177 | |
| 10 | 198 |
| Value | Count | Frequency (%) |
| 211 | 6 | < 0.1% |
| 210 | 12 | 0.1% |
| 209 | 11 | < 0.1% |
| 208 | 8 | < 0.1% |
| 207 | 11 | < 0.1% |
| 206 | 15 | 0.1% |
| 205 | 14 | 0.1% |
| 204 | 21 | |
| 203 | 43 | |
| 202 | 23 |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80.79737469 |
| Minimum | 1 |
|---|---|
| Maximum | 211 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 36 |
| median | 73 |
| Q3 | 119 |
| 95-th percentile | 179 |
| Maximum | 211 |
| Range | 210 |
| Interquartile range (IQR) | 83 |
Descriptive statistics
| Standard deviation | 53.23290188 |
|---|---|
| Coefficient of variation (CV) | 0.6588444499 |
| Kurtosis | -0.7663150753 |
| Mean | 80.79737469 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 0.4438615419 |
| Sum | 1932754 |
| Variance | 2833.741843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 195 | 0.8% |
| 29 | 189 | 0.8% |
| 14 | 188 | 0.8% |
| 38 | 184 | 0.8% |
| 18 | 183 | 0.8% |
| 4 | 182 | 0.8% |
| 55 | 182 | 0.8% |
| 36 | 182 | 0.8% |
| 37 | 182 | 0.8% |
| 27 | 180 | 0.8% |
| Other values (201) | 22074 |
| Value | Count | Frequency (%) |
| 1 | 195 | |
| 2 | 180 | |
| 3 | 151 | |
| 4 | 182 | |
| 5 | 174 | |
| 6 | 168 | |
| 7 | 175 | |
| 8 | 157 | |
| 9 | 136 | |
| 10 | 157 |
| Value | Count | Frequency (%) |
| 211 | 5 | < 0.1% |
| 210 | 13 | 0.1% |
| 209 | 12 | 0.1% |
| 208 | 8 | < 0.1% |
| 207 | 17 | 0.1% |
| 206 | 19 | 0.1% |
| 205 | 19 | 0.1% |
| 204 | 19 | 0.1% |
| 203 | 39 | |
| 202 | 48 |
| Distinct | 1686 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 323.4014882 |
| Minimum | 0 |
|---|---|
| Maximum | 2164 |
| Zeros | 14290 |
| Zeros (%) | 59.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 547 |
| 95-th percentile | 1439 |
| Maximum | 2164 |
| Range | 2164 |
| Interquartile range (IQR) | 547 |
Descriptive statistics
| Standard deviation | 500.8257245 |
|---|---|
| Coefficient of variation (CV) | 1.548619109 |
| Kurtosis | 0.4078698977 |
| Mean | 323.4014882 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.348462631 |
| Sum | 7736087 |
| Variance | 250826.4064 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| 260 | 27 | 0.1% |
| 1174 | 22 | 0.1% |
| 369 | 20 | 0.1% |
| 340 | 19 | 0.1% |
| 924 | 19 | 0.1% |
| 228 | 19 | 0.1% |
| 323 | 19 | 0.1% |
| 389 | 18 | 0.1% |
| 374 | 18 | 0.1% |
| Other values (1676) | 9450 |
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| 1 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 6 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2164 | 2 | |
| 2160 | 2 | |
| 2124 | 2 | |
| 2036 | 2 | |
| 2017 | 1 | |
| 1998 | 1 | |
| 1955 | 2 | |
| 1832 | 2 | |
| 1828 | 1 | |
| 1827 | 2 |
| Distinct | 1679 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 315.4535764 |
| Minimum | 0 |
|---|---|
| Maximum | 2164 |
| Zeros | 14288 |
| Zeros (%) | 59.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 523 |
| 95-th percentile | 1416 |
| Maximum | 2164 |
| Range | 2164 |
| Interquartile range (IQR) | 523 |
Descriptive statistics
| Standard deviation | 490.9442731 |
|---|---|
| Coefficient of variation (CV) | 1.556312275 |
| Kurtosis | 0.4746270007 |
| Mean | 315.4535764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.366652059 |
| Sum | 7545965 |
| Variance | 241026.2793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| 551 | 18 | 0.1% |
| 374 | 18 | 0.1% |
| 298 | 18 | 0.1% |
| 316 | 17 | 0.1% |
| 255 | 17 | 0.1% |
| 332 | 17 | 0.1% |
| 329 | 17 | 0.1% |
| 1378 | 17 | 0.1% |
| 338 | 17 | 0.1% |
| Other values (1669) | 9477 |
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| 1 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 11 | < 0.1% |
| 5 | 12 | 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2164 | 1 | < 0.1% |
| 2124 | 2 | |
| 2104 | 1 | < 0.1% |
| 2099 | 4 | |
| 2087 | 1 | < 0.1% |
| 2041 | 1 | < 0.1% |
| 2036 | 2 | |
| 2014 | 1 | < 0.1% |
| 1832 | 4 | |
| 1828 | 1 | < 0.1% |
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.609213662 |
| Minimum | 0 |
|---|---|
| Maximum | 31 |
| Zeros | 6273 |
| Zeros (%) | 26.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 31 |
| Range | 31 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.630126712 |
|---|---|
| Coefficient of variation (CV) | 1.01299582 |
| Kurtosis | 12.8072405 |
| Mean | 1.609213662 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.210624947 |
| Sum | 38494 |
| Variance | 2.657313098 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 7229 | |
| 0 | 6273 | |
| 2 | 5263 | |
| 3 | 2613 | 10.9% |
| 4 | 1330 | 5.6% |
| 5 | 583 | 2.4% |
| 6 | 292 | 1.2% |
| 7 | 142 | 0.6% |
| 8 | 84 | 0.4% |
| 9 | 43 | 0.2% |
| Other values (11) | 69 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 6273 | |
| 1 | 7229 | |
| 2 | 5263 | |
| 3 | 2613 | 10.9% |
| 4 | 1330 | 5.6% |
| 5 | 583 | 2.4% |
| 6 | 292 | 1.2% |
| 7 | 142 | 0.6% |
| 8 | 84 | 0.4% |
| 9 | 43 | 0.2% |
| Value | Count | Frequency (%) |
| 31 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 3 | < 0.1% |
| 15 | 3 | < 0.1% |
| 14 | 6 | |
| 13 | 6 | |
| 12 | 9 | |
| 11 | 12 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.068266377 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 9558 |
| Zeros (%) | 40.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.263944313 |
|---|---|
| Coefficient of variation (CV) | 1.183173355 |
| Kurtosis | 10.92452874 |
| Mean | 1.068266377 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.187963747 |
| Sum | 25554 |
| Variance | 1.597555226 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9558 | |
| 1 | 7759 | |
| 2 | 4013 | |
| 3 | 1551 | 6.5% |
| 4 | 594 | 2.5% |
| 5 | 214 | 0.9% |
| 6 | 107 | 0.4% |
| 7 | 67 | 0.3% |
| 8 | 27 | 0.1% |
| 10 | 11 | < 0.1% |
| Other values (8) | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 9558 | |
| 1 | 7759 | |
| 2 | 4013 | |
| 3 | 1551 | 6.5% |
| 4 | 594 | 2.5% |
| 5 | 214 | 0.9% |
| 6 | 107 | 0.4% |
| 7 | 67 | 0.3% |
| 8 | 27 | 0.1% |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 11 | |
| 9 | 9 | < 0.1% |
| 8 | 27 |
| Distinct | 82 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| friendly | |
|---|---|
| fifa world cup qualification | |
| uefa euro qualification | |
| african cup of nations qualification | |
| afc asian cup qualification | 541 |
| Other values (77) |
Length
| Max length | 42 |
|---|---|
| Median length | 37 |
| Mean length | 17.90719452 |
| Min length | 7 |
Characters and Unicode
| Total characters | 428358 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | fifa world cup qualification |
|---|---|
| 2nd row | friendly |
| 3rd row | fifa world cup qualification |
| 4th row | friendly |
| 5th row | fifa world cup qualification |
Common Values
| Value | Count | Frequency (%) |
| friendly | 8558 | |
| fifa world cup qualification | 5528 | |
| uefa euro qualification | 1723 | 7.2% |
| african cup of nations qualification | 1274 | 5.3% |
| afc asian cup qualification | 541 | 2.3% |
| african cup of nations | 490 | 2.0% |
| fifa world cup | 432 | 1.8% |
| uefa nations league | 415 | 1.7% |
| cosafa cup | 309 | 1.3% |
| cecafa cup | 308 | 1.3% |
| Other values (72) | 4343 |
Length
| Value | Count | Frequency (%) |
| cup | 11350 | |
| qualification | 9647 | |
| friendly | 8558 | |
| world | 5960 | |
| fifa | 5960 | |
| nations | 2763 | 4.5% |
| uefa | 2391 | 3.9% |
| african | 2053 | 3.4% |
| euro | 1976 | 3.2% |
| of | 1767 | 2.9% |
| Other values (89) | 8833 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 52660 | |
| a | 44710 | |
| f | 40430 | |
| 37337 | 8.7% | |
| n | 30043 | 7.0% |
| c | 28733 | 6.7% |
| u | 27195 | 6.3% |
| l | 26217 | 6.1% |
| o | 24922 | 5.8% |
| r | 20435 | 4.8% |
| Other values (22) | 95676 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 390918 | |
| Space Separator | 37337 | 8.7% |
| Other Punctuation | 98 | < 0.1% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 52660 | |
| a | 44710 | |
| f | 40430 | |
| n | 30043 | 7.7% |
| c | 28733 | 7.4% |
| u | 27195 | 7.0% |
| l | 26217 | 6.7% |
| o | 24922 | 6.4% |
| r | 20435 | 5.2% |
| e | 16558 | 4.2% |
| Other values (18) | 79015 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 | |
| – | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 37337 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 98 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 390918 | |
| Common | 37440 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 52660 | |
| a | 44710 | |
| f | 40430 | |
| n | 30043 | 7.7% |
| c | 28733 | 7.4% |
| u | 27195 | 7.0% |
| l | 26217 | 6.7% |
| o | 24922 | 6.4% |
| r | 20435 | 5.2% |
| e | 16558 | 4.2% |
| Other values (18) | 79015 |
Common
| Value | Count | Frequency (%) |
| 37337 | ||
| ' | 98 | 0.3% |
| - | 4 | < 0.1% |
| – | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 427968 | |
| None | 389 | 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 52660 | |
| a | 44710 | |
| f | 40430 | |
| 37337 | 8.7% | |
| n | 30043 | 7.0% |
| c | 28733 | 6.7% |
| u | 27195 | 6.4% |
| l | 26217 | 6.1% |
| o | 24922 | 5.8% |
| r | 20435 | 4.8% |
| Other values (18) | 95286 |
None
| Value | Count | Frequency (%) |
| é | 304 | |
| í | 77 | 19.8% |
| á | 8 | 2.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
| Distinct | 1576 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| doha | 397 |
|---|---|
| bangkok | 215 |
| muscat | 212 |
| kuwait city | 202 |
| abu dhabi | 191 |
| Other values (1571) |
Length
| Max length | 28 |
|---|---|
| Median length | 24 |
| Mean length | 7.724133606 |
| Min length | 2 |
Characters and Unicode
| Total characters | 184769 |
|---|---|
| Distinct characters | 86 |
| Distinct categories | 7 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 474 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | la paz |
|---|---|
| 2nd row | maceió |
| 3rd row | quito |
| 4th row | conakry |
| 5th row | asunción |
Common Values
| Value | Count | Frequency (%) |
| doha | 397 | 1.7% |
| bangkok | 215 | 0.9% |
| muscat | 212 | 0.9% |
| kuwait city | 202 | 0.8% |
| abu dhabi | 191 | 0.8% |
| london | 185 | 0.8% |
| amman | 180 | 0.8% |
| cairo | 164 | 0.7% |
| dubai | 162 | 0.7% |
| tehran | 161 | 0.7% |
| Other values (1566) | 21852 |
Length
| Value | Count | Frequency (%) |
| city | 527 | 1.9% |
| san | 435 | 1.5% |
| doha | 397 | 1.4% |
| port | 241 | 0.8% |
| bangkok | 215 | 0.8% |
| muscat | 212 | 0.7% |
| kuwait | 203 | 0.7% |
| abu | 191 | 0.7% |
| dhabi | 191 | 0.7% |
| london | 187 | 0.7% |
| Other values (1715) | 25660 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 27914 | |
| n | 13401 | 7.3% |
| o | 12960 | 7.0% |
| i | 12438 | 6.7% |
| e | 11669 | 6.3% |
| r | 11149 | 6.0% |
| s | 10219 | 5.5% |
| l | 9384 | 5.1% |
| t | 8861 | 4.8% |
| u | 7716 | 4.2% |
| Other values (76) | 59058 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 179266 | |
| Space Separator | 4538 | 2.5% |
| Dash Punctuation | 530 | 0.3% |
| Other Punctuation | 417 | 0.2% |
| Nonspacing Mark | 7 | < 0.1% |
| Initial Punctuation | 6 | < 0.1% |
| Decimal Number | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 27914 | |
| n | 13401 | 7.5% |
| o | 12960 | 7.2% |
| i | 12438 | 6.9% |
| e | 11669 | 6.5% |
| r | 11149 | 6.2% |
| s | 10219 | 5.7% |
| l | 9384 | 5.2% |
| t | 8861 | 4.9% |
| u | 7716 | 4.3% |
| Other values (68) | 53555 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 251 | |
| . | 162 | |
| / | 4 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4538 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 530 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̇ | 7 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 6 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 179266 | |
| Common | 5496 | 3.0% |
| Inherited | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 27914 | |
| n | 13401 | 7.5% |
| o | 12960 | 7.2% |
| i | 12438 | 6.9% |
| e | 11669 | 6.5% |
| r | 11149 | 6.2% |
| s | 10219 | 5.7% |
| l | 9384 | 5.2% |
| t | 8861 | 4.9% |
| u | 7716 | 4.3% |
| Other values (68) | 53555 |
Common
| Value | Count | Frequency (%) |
| 4538 | ||
| - | 530 | 9.6% |
| ' | 251 | 4.6% |
| . | 162 | 2.9% |
| ‘ | 6 | 0.1% |
| 6 | 5 | 0.1% |
| / | 4 | 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ̇ | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 182796 | |
| None | 1947 | 1.1% |
| Diacriticals | 7 | < 0.1% |
| Latin Ext Additional | 7 | < 0.1% |
| Punctuation | 6 | < 0.1% |
| IPA Ext | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 27914 | |
| n | 13401 | 7.3% |
| o | 12960 | 7.1% |
| i | 12438 | 6.8% |
| e | 11669 | 6.4% |
| r | 11149 | 6.1% |
| s | 10219 | 5.6% |
| l | 9384 | 5.1% |
| t | 8861 | 4.8% |
| u | 7716 | 4.2% |
| Other values (22) | 57085 |
None
| Value | Count | Frequency (%) |
| é | 549 | |
| ó | 301 | |
| í | 185 | 9.5% |
| è | 104 | 5.3% |
| ș | 97 | 5.0% |
| ă | 85 | 4.4% |
| á | 73 | 3.7% |
| ã | 63 | 3.2% |
| à | 59 | 3.0% |
| ò | 57 | 2.9% |
| Other values (36) | 374 |
Diacriticals
| Value | Count | Frequency (%) |
| ̇ | 7 |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 6 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 6 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 2 | |
| ị | 2 | |
| ủ | 1 | |
| ộ | 1 | |
| ầ | 1 |
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| usa | 1003 |
|---|---|
| south africa | 505 |
| united arab emirates | 462 |
| qatar | 461 |
| france | 445 |
| Other values (212) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 8.085238911 |
| Min length | 3 |
Characters and Unicode
| Total characters | 193407 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | bolivia |
|---|---|
| 2nd row | brazil |
| 3rd row | ecuador |
| 4th row | guinea |
| 5th row | paraguay |
Common Values
| Value | Count | Frequency (%) |
| usa | 1003 | 4.2% |
| south africa | 505 | 2.1% |
| united arab emirates | 462 | 1.9% |
| qatar | 461 | 1.9% |
| france | 445 | 1.9% |
| germany | 285 | 1.2% |
| saudi arabia | 280 | 1.2% |
| thailand | 280 | 1.2% |
| japan | 277 | 1.2% |
| england | 277 | 1.2% |
| Other values (207) | 19646 |
Length
| Value | Count | Frequency (%) |
| usa | 1003 | 3.4% |
| republic | 658 | 2.2% |
| south | 513 | 1.7% |
| and | 509 | 1.7% |
| africa | 505 | 1.7% |
| united | 462 | 1.5% |
| arab | 462 | 1.5% |
| emirates | 462 | 1.5% |
| qatar | 461 | 1.5% |
| france | 445 | 1.5% |
| Other values (244) | 24377 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 33081 | |
| i | 17182 | 8.9% |
| n | 16016 | 8.3% |
| e | 13856 | 7.2% |
| r | 13736 | 7.1% |
| o | 9701 | 5.0% |
| s | 9381 | 4.9% |
| t | 8788 | 4.5% |
| u | 8750 | 4.5% |
| l | 8124 | 4.2% |
| Other values (25) | 54792 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 187201 | |
| Space Separator | 5936 | 3.1% |
| Other Punctuation | 243 | 0.1% |
| Dash Punctuation | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 33081 | |
| i | 17182 | 9.2% |
| n | 16016 | 8.6% |
| e | 13856 | 7.4% |
| r | 13736 | 7.3% |
| o | 9701 | 5.2% |
| s | 9381 | 5.0% |
| t | 8788 | 4.7% |
| u | 8750 | 4.7% |
| l | 8124 | 4.3% |
| Other values (21) | 48586 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 151 | |
| ' | 92 |
Space Separator
| Value | Count | Frequency (%) |
| 5936 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 187201 | |
| Common | 6206 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 33081 | |
| i | 17182 | 9.2% |
| n | 16016 | 8.6% |
| e | 13856 | 7.4% |
| r | 13736 | 7.3% |
| o | 9701 | 5.2% |
| s | 9381 | 5.0% |
| t | 8788 | 4.7% |
| u | 8750 | 4.7% |
| l | 8124 | 4.3% |
| Other values (21) | 48586 |
Common
| Value | Count | Frequency (%) |
| 5936 | ||
| . | 151 | 2.4% |
| ' | 92 | 1.5% |
| - | 27 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 193237 | |
| None | 170 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 33081 | |
| i | 17182 | 8.9% |
| n | 16016 | 8.3% |
| e | 13856 | 7.2% |
| r | 13736 | 7.1% |
| o | 9701 | 5.0% |
| s | 9381 | 4.9% |
| t | 8788 | 4.5% |
| u | 8750 | 4.5% |
| l | 8124 | 4.2% |
| Other values (20) | 54622 |
None
| Value | Count | Frequency (%) |
| ô | 92 | |
| ç | 31 | 18.2% |
| é | 17 | 10.0% |
| ã | 15 | 8.8% |
| í | 15 | 8.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.5 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 17947 | |
| True | 5974 | 25.0% |
shoot_out
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.5 KiB |
| False | |
|---|---|
| True | 332 |
| Value | Count | Frequency (%) |
| False | 23589 | |
| True | 332 | 1.4% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| win | |
|---|---|
| lose | |
| draw |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.508339952 |
| Min length | 3 |
Characters and Unicode
| Total characters | 83923 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | win |
|---|---|
| 2nd row | draw |
| 3rd row | win |
| 4th row | win |
| 5th row | lose |
Common Values
| Value | Count | Frequency (%) |
| win | 11761 | |
| lose | 6771 | |
| draw | 5389 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| win | 11761 | |
| lose | 6771 | |
| draw | 5389 |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 17150 | |
| i | 11761 | |
| n | 11761 | |
| l | 6771 | 8.1% |
| o | 6771 | 8.1% |
| s | 6771 | 8.1% |
| e | 6771 | 8.1% |
| d | 5389 | 6.4% |
| r | 5389 | 6.4% |
| a | 5389 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 83923 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 17150 | |
| i | 11761 | |
| n | 11761 | |
| l | 6771 | 8.1% |
| o | 6771 | 8.1% |
| s | 6771 | 8.1% |
| e | 6771 | 8.1% |
| d | 5389 | 6.4% |
| r | 5389 | 6.4% |
| a | 5389 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83923 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 17150 | |
| i | 11761 | |
| n | 11761 | |
| l | 6771 | 8.1% |
| o | 6771 | 8.1% |
| s | 6771 | 8.1% |
| e | 6771 | 8.1% |
| d | 5389 | 6.4% |
| r | 5389 | 6.4% |
| a | 5389 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83923 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| w | 17150 | |
| i | 11761 | |
| n | 11761 | |
| l | 6771 | 8.1% |
| o | 6771 | 8.1% |
| s | 6771 | 8.1% |
| e | 6771 | 8.1% |
| d | 5389 | 6.4% |
| r | 5389 | 6.4% |
| a | 5389 | 6.4% |
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 15542 |
| Missing (%) | 65.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.96383817 |
| Minimum | 47 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 47 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 70 |
| median | 75 |
| Q3 | 81 |
| 95-th percentile | 88 |
| Maximum | 97 |
| Range | 50 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.212242192 |
|---|---|
| Coefficient of variation (CV) | 0.1095493826 |
| Kurtosis | 0.1345376971 |
| Mean | 74.96383817 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.2847013488 |
| Sum | 628122 |
| Variance | 67.44092181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 474 | 2.0% |
| 74 | 441 | 1.8% |
| 75 | 437 | 1.8% |
| 76 | 420 | 1.8% |
| 72 | 403 | 1.7% |
| 77 | 374 | 1.6% |
| 79 | 347 | 1.5% |
| 81 | 344 | 1.4% |
| 80 | 325 | 1.4% |
| 82 | 324 | 1.4% |
| Other values (40) | 4490 | 18.8% |
| (Missing) | 15542 |
| Value | Count | Frequency (%) |
| 47 | 6 | < 0.1% |
| 48 | 7 | < 0.1% |
| 49 | 9 | < 0.1% |
| 50 | 12 | 0.1% |
| 51 | 17 | |
| 52 | 30 | |
| 53 | 24 | |
| 54 | 20 | |
| 55 | 42 | |
| 56 | 32 |
| Value | Count | Frequency (%) |
| 97 | 6 | < 0.1% |
| 95 | 19 | 0.1% |
| 94 | 26 | 0.1% |
| 93 | 31 | 0.1% |
| 92 | 22 | 0.1% |
| 91 | 51 | 0.2% |
| 90 | 113 | |
| 89 | 120 | |
| 88 | 111 | |
| 87 | 145 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 15826 |
| Missing (%) | 66.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.21247684 |
| Minimum | 47 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 47 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 69 |
| median | 74 |
| Q3 | 80 |
| 95-th percentile | 87 |
| Maximum | 97 |
| Range | 50 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.225919096 |
|---|---|
| Coefficient of variation (CV) | 0.110842805 |
| Kurtosis | 0.1394057478 |
| Mean | 74.21247684 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.2838054963 |
| Sum | 600750 |
| Variance | 67.66574498 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 473 | 2.0% |
| 73 | 466 | 1.9% |
| 72 | 424 | 1.8% |
| 74 | 404 | 1.7% |
| 76 | 365 | 1.5% |
| 77 | 343 | 1.4% |
| 69 | 340 | 1.4% |
| 81 | 313 | 1.3% |
| 78 | 311 | 1.3% |
| 79 | 310 | 1.3% |
| Other values (40) | 4346 | 18.2% |
| (Missing) | 15826 |
| Value | Count | Frequency (%) |
| 47 | 6 | < 0.1% |
| 48 | 10 | < 0.1% |
| 49 | 10 | < 0.1% |
| 50 | 21 | |
| 51 | 21 | |
| 52 | 30 | |
| 53 | 23 | |
| 54 | 24 | |
| 55 | 47 | |
| 56 | 39 |
| Value | Count | Frequency (%) |
| 97 | 5 | < 0.1% |
| 95 | 12 | 0.1% |
| 94 | 14 | 0.1% |
| 93 | 24 | 0.1% |
| 92 | 16 | 0.1% |
| 91 | 43 | 0.2% |
| 90 | 87 | |
| 89 | 91 | |
| 88 | 97 | |
| 87 | 122 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 16134 |
| Missing (%) | 67.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.903249 |
| Minimum | 52.8 |
|---|---|
| Maximum | 91.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 52.8 |
|---|---|
| 5-th percentile | 65 |
| Q1 | 71 |
| median | 75.2 |
| Q3 | 78.8 |
| 95-th percentile | 85 |
| Maximum | 91.8 |
| Range | 39 |
| Interquartile range (IQR) | 7.8 |
Descriptive statistics
| Standard deviation | 6.003114482 |
|---|---|
| Coefficient of variation (CV) | 0.08014491443 |
| Kurtosis | 0.02149570954 |
| Mean | 74.903249 |
| Median Absolute Deviation (MAD) | 3.8 |
| Skewness | -0.1065522626 |
| Sum | 583271.6 |
| Variance | 36.03738348 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75.5 | 194 | 0.8% |
| 77 | 182 | 0.8% |
| 76 | 178 | 0.7% |
| 76.5 | 177 | 0.7% |
| 75.2 | 159 | 0.7% |
| 78.2 | 150 | 0.6% |
| 71.5 | 143 | 0.6% |
| 77.8 | 142 | 0.6% |
| 70.8 | 141 | 0.6% |
| 74.2 | 140 | 0.6% |
| Other values (117) | 6181 | 25.8% |
| (Missing) | 16134 |
| Value | Count | Frequency (%) |
| 52.8 | 6 | < 0.1% |
| 56.5 | 11 | |
| 57.5 | 1 | < 0.1% |
| 57.8 | 8 | |
| 58.2 | 3 | < 0.1% |
| 58.5 | 16 | |
| 58.8 | 3 | < 0.1% |
| 59 | 8 | |
| 59.2 | 4 | < 0.1% |
| 59.5 | 10 |
| Value | Count | Frequency (%) |
| 91.8 | 6 | < 0.1% |
| 90.5 | 4 | < 0.1% |
| 90.2 | 10 | < 0.1% |
| 89.5 | 11 | < 0.1% |
| 89 | 7 | < 0.1% |
| 88 | 21 | |
| 87.8 | 15 | 0.1% |
| 87.5 | 43 | |
| 87.2 | 10 | < 0.1% |
| 87 | 22 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 16357 |
| Missing (%) | 68.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.42437864 |
| Minimum | 52.8 |
|---|---|
| Maximum | 91.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 52.8 |
|---|---|
| 5-th percentile | 64.8 |
| Q1 | 70.5 |
| median | 74.5 |
| Q3 | 78.2 |
| 95-th percentile | 84.8 |
| Maximum | 91.8 |
| Range | 39 |
| Interquartile range (IQR) | 7.7 |
Descriptive statistics
| Standard deviation | 5.937425305 |
|---|---|
| Coefficient of variation (CV) | 0.07977796273 |
| Kurtosis | 0.02034776311 |
| Mean | 74.42437864 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | -0.04461795013 |
| Sum | 562946 |
| Variance | 35.25301925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75.5 | 175 | 0.7% |
| 77 | 167 | 0.7% |
| 76 | 162 | 0.7% |
| 74.5 | 157 | 0.7% |
| 75.2 | 157 | 0.7% |
| 76.5 | 152 | 0.6% |
| 70.8 | 151 | 0.6% |
| 74.2 | 148 | 0.6% |
| 71.5 | 138 | 0.6% |
| 78.2 | 136 | 0.6% |
| Other values (117) | 6021 | 25.2% |
| (Missing) | 16357 |
| Value | Count | Frequency (%) |
| 52.8 | 7 | |
| 56.5 | 8 | |
| 57.5 | 4 | < 0.1% |
| 57.8 | 5 | < 0.1% |
| 58.2 | 4 | < 0.1% |
| 58.5 | 12 | |
| 58.8 | 6 | |
| 59 | 10 | |
| 59.2 | 1 | < 0.1% |
| 59.5 | 13 |
| Value | Count | Frequency (%) |
| 91.8 | 5 | < 0.1% |
| 90.5 | 7 | < 0.1% |
| 90.2 | 7 | < 0.1% |
| 89.5 | 3 | < 0.1% |
| 89 | 10 | < 0.1% |
| 88 | 17 | |
| 87.8 | 16 | |
| 87.5 | 30 | |
| 87.2 | 10 | < 0.1% |
| 87 | 21 |
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 15759 |
| Missing (%) | 65.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.88929184 |
| Minimum | 54.2 |
|---|---|
| Maximum | 93.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 54.2 |
|---|---|
| 5-th percentile | 65 |
| Q1 | 72.5 |
| median | 76.2 |
| Q3 | 79.5 |
| 95-th percentile | 86 |
| Maximum | 93.2 |
| Range | 39 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.053109555 |
|---|---|
| Coefficient of variation (CV) | 0.07976236711 |
| Kurtosis | 0.2554212993 |
| Mean | 75.88929184 |
| Median Absolute Deviation (MAD) | 3.4 |
| Skewness | -0.2912718461 |
| Sum | 619408.4 |
| Variance | 36.64013529 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76.2 | 243 | 1.0% |
| 76.8 | 207 | 0.9% |
| 75 | 192 | 0.8% |
| 74.8 | 189 | 0.8% |
| 78.2 | 188 | 0.8% |
| 78.5 | 182 | 0.8% |
| 75.2 | 169 | 0.7% |
| 78 | 164 | 0.7% |
| 79.2 | 158 | 0.7% |
| 77.2 | 157 | 0.7% |
| Other values (124) | 6313 | |
| (Missing) | 15759 |
| Value | Count | Frequency (%) |
| 54.2 | 2 | < 0.1% |
| 55.5 | 7 | |
| 56 | 1 | < 0.1% |
| 56.8 | 2 | < 0.1% |
| 57.2 | 7 | |
| 57.5 | 12 | |
| 57.8 | 8 | |
| 58 | 8 | |
| 58.2 | 3 | < 0.1% |
| 58.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 93.2 | 7 | < 0.1% |
| 92 | 7 | < 0.1% |
| 91.2 | 4 | < 0.1% |
| 89.8 | 7 | < 0.1% |
| 89.5 | 14 | |
| 89.2 | 9 | < 0.1% |
| 89 | 19 | |
| 88.8 | 4 | < 0.1% |
| 88.5 | 24 | |
| 88.2 | 17 |
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 15942 |
| Missing (%) | 66.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.25914275 |
| Minimum | 54.2 |
|---|---|
| Maximum | 93.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 54.2 |
|---|---|
| 5-th percentile | 63.8 |
| Q1 | 71.8 |
| median | 75.5 |
| Q3 | 79 |
| 95-th percentile | 85.5 |
| Maximum | 93.2 |
| Range | 39 |
| Interquartile range (IQR) | 7.2 |
Descriptive statistics
| Standard deviation | 6.124573345 |
|---|---|
| Coefficient of variation (CV) | 0.0813797915 |
| Kurtosis | 0.188041849 |
| Mean | 75.25914275 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | -0.2751545753 |
| Sum | 600492.7 |
| Variance | 37.51039866 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76.2 | 208 | 0.9% |
| 75 | 196 | 0.8% |
| 74.8 | 185 | 0.8% |
| 78 | 169 | 0.7% |
| 76.8 | 169 | 0.7% |
| 78.5 | 168 | 0.7% |
| 77.2 | 166 | 0.7% |
| 78.2 | 165 | 0.7% |
| 74.5 | 156 | 0.7% |
| 75.2 | 155 | 0.6% |
| Other values (124) | 6242 | 26.1% |
| (Missing) | 15942 |
| Value | Count | Frequency (%) |
| 54.2 | 10 | |
| 55.5 | 6 | |
| 56 | 2 | < 0.1% |
| 56.8 | 3 | < 0.1% |
| 57.2 | 8 | |
| 57.5 | 7 | |
| 57.8 | 6 | |
| 58 | 6 | |
| 58.2 | 6 | |
| 58.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 93.2 | 3 | < 0.1% |
| 92 | 5 | < 0.1% |
| 91.2 | 7 | < 0.1% |
| 89.8 | 10 | < 0.1% |
| 89.5 | 8 | < 0.1% |
| 89.2 | 5 | < 0.1% |
| 89 | 15 | |
| 88.8 | 2 | < 0.1% |
| 88.5 | 25 | |
| 88.2 | 8 | < 0.1% |
| Distinct | 103 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 15411 |
| Missing (%) | 64.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.81874266 |
| Minimum | 53.3 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 53.3 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 71.7 |
| median | 75.7 |
| Q3 | 80 |
| 95-th percentile | 86.7 |
| Maximum | 93 |
| Range | 39.7 |
| Interquartile range (IQR) | 8.3 |
Descriptive statistics
| Standard deviation | 6.26841591 |
|---|---|
| Coefficient of variation (CV) | 0.08267633689 |
| Kurtosis | -0.05510707391 |
| Mean | 75.81874266 |
| Median Absolute Deviation (MAD) | 4.3 |
| Skewness | 0.01423465344 |
| Sum | 645217.5 |
| Variance | 39.29303802 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76.3 | 229 | 1.0% |
| 77.7 | 223 | 0.9% |
| 72.7 | 218 | 0.9% |
| 76.7 | 218 | 0.9% |
| 71.3 | 218 | 0.9% |
| 72.3 | 215 | 0.9% |
| 74.7 | 197 | 0.8% |
| 73.3 | 194 | 0.8% |
| 73.7 | 184 | 0.8% |
| 75.7 | 174 | 0.7% |
| Other values (93) | 6440 | |
| (Missing) | 15411 |
| Value | Count | Frequency (%) |
| 53.3 | 4 | < 0.1% |
| 55 | 3 | < 0.1% |
| 57.7 | 6 | < 0.1% |
| 58 | 7 | < 0.1% |
| 58.3 | 4 | < 0.1% |
| 59 | 12 | |
| 59.3 | 3 | < 0.1% |
| 59.7 | 23 | |
| 60 | 14 | |
| 60.3 | 18 |
| Value | Count | Frequency (%) |
| 93 | 13 | 0.1% |
| 92.7 | 7 | < 0.1% |
| 92.3 | 13 | 0.1% |
| 91 | 19 | |
| 90.7 | 6 | < 0.1% |
| 90.3 | 13 | 0.1% |
| 90 | 25 | |
| 89.3 | 34 | |
| 89 | 12 | 0.1% |
| 88.7 | 37 |
| Distinct | 103 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 15609 |
| Missing (%) | 65.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.42001925 |
| Minimum | 53.3 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 53.3 |
|---|---|
| 5-th percentile | 65.7 |
| Q1 | 71.3 |
| median | 75.3 |
| Q3 | 79.7 |
| 95-th percentile | 86 |
| Maximum | 93 |
| Range | 39.7 |
| Interquartile range (IQR) | 8.4 |
Descriptive statistics
| Standard deviation | 6.201905739 |
|---|---|
| Coefficient of variation (CV) | 0.08223155869 |
| Kurtosis | -0.05994827244 |
| Mean | 75.42001925 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.006600849852 |
| Sum | 626891.2 |
| Variance | 38.4636348 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 72.3 | 238 | 1.0% |
| 71.3 | 234 | 1.0% |
| 72.7 | 213 | 0.9% |
| 76.7 | 206 | 0.9% |
| 76.3 | 203 | 0.8% |
| 77.7 | 200 | 0.8% |
| 73 | 192 | 0.8% |
| 70.7 | 184 | 0.8% |
| 75.3 | 180 | 0.8% |
| 74.7 | 178 | 0.7% |
| Other values (93) | 6284 | |
| (Missing) | 15609 |
| Value | Count | Frequency (%) |
| 53.3 | 4 | < 0.1% |
| 55 | 4 | < 0.1% |
| 57.7 | 7 | < 0.1% |
| 58 | 8 | < 0.1% |
| 58.3 | 3 | < 0.1% |
| 59 | 12 | 0.1% |
| 59.3 | 5 | < 0.1% |
| 59.7 | 30 | |
| 60 | 6 | < 0.1% |
| 60.3 | 20 |
| Value | Count | Frequency (%) |
| 93 | 8 | < 0.1% |
| 92.7 | 5 | < 0.1% |
| 92.3 | 15 | |
| 91 | 13 | |
| 90.7 | 6 | < 0.1% |
| 90.3 | 8 | < 0.1% |
| 90 | 13 | |
| 89.3 | 24 | |
| 89 | 15 | |
| 88.7 | 26 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| date | year | month | day | home_team | away_team | home_team_continent | away_team_continent | home_team_fifa_rank | away_team_fifa_rank | home_team_total_fifa_points | away_team_total_fifa_points | home_team_score | away_team_score | tournament | city | country | neutral_location | shoot_out | home_team_result | home_team_goalkeeper_score | away_team_goalkeeper_score | home_team_mean_defense_score | away_team_mean_defense_score | home_team_mean_midfield_score | away_team_mean_midfield_score | home_team_mean_offense_score | away_team_mean_offense_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1993-08-08 | 1993 | 08 | 08 | bolivia | uruguay | south america | south america | 59 | 22 | 0 | 0 | 3 | 1 | fifa world cup qualification | la paz | bolivia | False | no | win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 1993-08-08 | 1993 | 08 | 08 | brazil | mexico | south america | north america | 8 | 14 | 0 | 0 | 1 | 1 | friendly | maceió | brazil | False | no | draw | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 1993-08-08 | 1993 | 08 | 08 | ecuador | venezuela | south america | south america | 35 | 94 | 0 | 0 | 5 | 0 | fifa world cup qualification | quito | ecuador | False | no | win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 1993-08-08 | 1993 | 08 | 08 | guinea | sierra leone | africa | africa | 65 | 86 | 0 | 0 | 1 | 0 | friendly | conakry | guinea | False | no | win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 1993-08-08 | 1993 | 08 | 08 | paraguay | argentina | south america | south america | 67 | 5 | 0 | 0 | 1 | 3 | fifa world cup qualification | asunción | paraguay | False | no | lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 1993-08-08 | 1993 | 08 | 08 | peru | colombia | south america | south america | 70 | 19 | 0 | 0 | 0 | 1 | fifa world cup qualification | lima | peru | False | no | lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 1993-08-08 | 1993 | 08 | 08 | zimbabwe | eswatini | africa | africa | 50 | 102 | 0 | 0 | 2 | 0 | friendly | harare | zimbabwe | False | no | win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 1993-08-09 | 1993 | 08 | 09 | guinea | sierra leone | africa | africa | 65 | 86 | 0 | 0 | 4 | 0 | friendly | conakry | guinea | False | no | win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 1993-08-11 | 1993 | 08 | 11 | faroe islands | norway | europe | europe | 111 | 9 | 0 | 0 | 0 | 7 | friendly | toftir | faroe islands | False | no | lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 1993-08-11 | 1993 | 08 | 11 | sweden | switzerland | europe | europe | 4 | 3 | 0 | 0 | 1 | 2 | friendly | borås | sweden | False | no | lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
Last rows
| date | year | month | day | home_team | away_team | home_team_continent | away_team_continent | home_team_fifa_rank | away_team_fifa_rank | home_team_total_fifa_points | away_team_total_fifa_points | home_team_score | away_team_score | tournament | city | country | neutral_location | shoot_out | home_team_result | home_team_goalkeeper_score | away_team_goalkeeper_score | home_team_mean_defense_score | away_team_mean_defense_score | home_team_mean_midfield_score | away_team_mean_midfield_score | home_team_mean_offense_score | away_team_mean_offense_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 23911 | 2022-06-14 | 2022 | 06 | 14 | ukraine | republic of ireland | europe | europe | 27 | 47 | 1535 | 1449 | 1 | 1 | uefa nations league | łódź | poland | True | no | draw | 75.0 | 75.0 | 74.8 | 76.5 | 80.0 | 73.8 | 78.7 | 72.7 |
| 23912 | 2022-06-14 | 2022 | 06 | 14 | bosnia and herzegovina | finland | europe | europe | 59 | 57 | 1388 | 1406 | 3 | 2 | uefa nations league | zenica | bosnia and herzegovina | False | no | win | 76.0 | 83.0 | 74.2 | 70.0 | 78.0 | 73.5 | 77.0 | 72.3 |
| 23913 | 2022-06-14 | 2022 | 06 | 14 | romania | montenegro | europe | europe | 48 | 70 | 1446 | 1342 | 0 | 3 | uefa nations league | bucharest | romania | False | no | lose | 77.0 | 65.0 | 73.5 | 76.2 | 75.0 | 68.2 | 73.7 | 74.7 |
| 23914 | 2022-06-14 | 2022 | 06 | 14 | luxembourg | faroe islands | europe | europe | 94 | 124 | 1229 | 1137 | 2 | 2 | uefa nations league | luxembourg | luxembourg | False | no | draw | 69.0 | NaN | 68.5 | NaN | 69.8 | NaN | NaN | NaN |
| 23915 | 2022-06-14 | 2022 | 06 | 14 | turkey | lithuania | europe | europe | 43 | 138 | 1461 | 1092 | 2 | 0 | uefa nations league | i̇zmir | turkey | False | no | win | 79.0 | 71.0 | 78.2 | NaN | 78.2 | NaN | 76.7 | NaN |
| 23916 | 2022-06-14 | 2022 | 06 | 14 | moldova | andorra | europe | europe | 180 | 153 | 932 | 1040 | 2 | 1 | uefa nations league | chișinău | moldova | False | no | win | 65.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 23917 | 2022-06-14 | 2022 | 06 | 14 | liechtenstein | latvia | europe | europe | 192 | 135 | 895 | 1105 | 0 | 2 | uefa nations league | vaduz | liechtenstein | False | no | lose | NaN | 65.0 | NaN | NaN | NaN | NaN | NaN | NaN |
| 23918 | 2022-06-14 | 2022 | 06 | 14 | chile | ghana | south america | africa | 28 | 60 | 1526 | 1387 | 0 | 0 | kirin cup | suita | japan | True | yes | lose | 79.0 | 74.0 | 75.5 | 75.5 | 78.2 | 78.2 | 76.7 | 76.0 |
| 23919 | 2022-06-14 | 2022 | 06 | 14 | japan | tunisia | asia | africa | 23 | 35 | 1553 | 1499 | 0 | 3 | kirin cup | suita | japan | False | no | lose | 73.0 | NaN | 75.2 | 70.8 | 77.5 | 74.0 | 75.0 | 72.3 |
| 23920 | 2022-06-14 | 2022 | 06 | 14 | korea republic | egypt | asia | africa | 29 | 32 | 1519 | 1500 | 4 | 1 | friendly | seoul | korea republic | False | no | win | 75.0 | NaN | 73.0 | NaN | 73.8 | 70.8 | 80.0 | 79.3 |